Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces

نویسندگان

Florent Bocquelet

Thomas Hueber

Laurent Girin

Christophe Savariaux

Blaise Yvert

چکیده

Restoring natural speech in paralyzed and aphasic people could be achieved using a Brain-Computer Interface (BCI) controlling a speech synthesizer in real-time. To reach this goal, a prerequisite is to develop a speech synthesizer producing intelligible speech in real-time with a reasonable number of control parameters. We present here an articulatory-based speech synthesizer that can be controlled in real-time for future BCI applications. This synthesizer converts movements of the main speech articulators (tongue, jaw, velum, and lips) into intelligible speech. The articulatory-to-acoustic mapping is performed using a deep neural network (DNN) trained on electromagnetic articulography (EMA) data recorded on a reference speaker synchronously with the produced speech signal. This DNN is then used in both offline and online modes to map the position of sensors glued on different speech articulators into acoustic parameters that are further converted into an audio signal using a vocoder. In offline mode, highly intelligible speech could be obtained as assessed by perceptual evaluation performed by 12 listeners. Then, to anticipate future BCI applications, we further assessed the real-time control of the synthesizer by both the reference speaker and new speakers, in a closed-loop paradigm using EMA data recorded in real time. A short calibration period was used to compensate for differences in sensor positions and articulatory differences between new speakers and the reference speaker. We found that real-time synthesis of vowels and consonants was possible with good intelligibility. In conclusion, these results open to future speech BCI applications using such articulatory-based speech synthesizer.

متن کامل

منابع مشابه

Robust articulatory speech synthesis using deep neural networks for BCI applications

Brain-Computer Interfaces (BCIs) usually propose typing strategies to restore communication for paralyzed and aphasic people. A more natural way would be to use speech BCI directly controlling a speech synthesizer. Toward this goal, a prerequisite is the development a synthesizer that should i) produce intelligible speech, ii) run in real time, iii) depend on as few parameters as possible, and ...

متن کامل

Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study

This article presents a pilot study on the real-time control of an articulatory synthesizer based on deep neural network (DNN), in the context of silent speech interface. The underlying hypothesis is that a silent speaker could benefit from real-time audio feedback to regulate his/her own production. In this study, we use 3D electromagnetic-articulography (EMA) to capture speech articulation, a...

متن کامل

The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis

The organization of a computational control model of articulatory speech synthesis is outlined in this paper. The model is based on general principles of neurophysiology and cognitive psychology. Thus it is based on such neural control circuits, neural maps and mappings as are hypothesized to exist in the human brain, and the model is based on learning or training mechanisms similar to those oc...

متن کامل

Session 2aSC: Linking Perception and Production (Poster Session) 2aSC55. Speech sensorimotor learning through a virtual vocal tract

Studies of speech sensorimotor learning often manipulate auditory feedback by modifying isolated acoustic parameters such as formant frequency or fundamental frequency using near real-time resynthesis of a participant's speech. An alternative approach is to engage a participant in a total remapping of the sensorimotor working space using a virtual vocal tract. To support this approach for study...

متن کامل

Interdisciplinary Approaches for Advancing Articulatory Speech Theory and Synthesis

Articulatory synthesis research has long been dominated by frequency domain and concatenate samplebased speech synthesis techniques. While successful in some domains (e.g., voice-based databases), these techniques still cannot produce natural looking and sounding speech from text for an arbitrary speaker. Natural looking and sounding speech technology is one of the next major milestones in voic...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل

عنوان ژورنال:

دوره 12 شماره

صفحات -

تاریخ انتشار 2016

Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces

نویسندگان

چکیده

منابع مشابه

Robust articulatory speech synthesis using deep neural networks for BCI applications

Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study

The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis

Session 2aSC: Linking Perception and Production (Poster Session) 2aSC55. Speech sensorimotor learning through a virtual vocal tract

Interdisciplinary Approaches for Advancing Articulatory Speech Theory and Synthesis

عنوان ژورنال:

اشتراک گذاری